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Adaptive identification of coherent states 

Markku P.V. StenbergQ Kevin Pack, and Frank K. Wilhelm 
Theoretical Physics, Saarland University, 66123 Saarbriicken, Germany 

We present methods for efficient characterization of an optical coherent state |a). We choose mea¬ 
surement settings adaptively and stochastically, based on data while it is collected. Our algorithm 
divides the estimation into two distinct steps: (i) before the first detection of a vacuum state, the 
probability of choosing a measurement setting is proportional to detecting vacuum with the setting, 
which makes using too similar measurement settings twice unlikely; and (ii) after the first detection 
of vacuum, we focus measurements in the region where vacuum is most likely to be detected. In 
step (i) [(ii)] the detection of vacuum (a photon) has a significantly larger effect on the shape of the 
posterior probability distribution of a. Compared to nonadaptive schemes, our method makes the 
number of measurement shots required to achieve a certain level of accuracy smaller approximately 
by a factor proportional to the area describing the initial uncertainty of a in phase space. While 
this algorithm is not directly robust against readout errors, we make it such by introducing repeated 
measurements in step (i). 

PACS numbers: 42.50.Ct, 42.50.Ar, 42.50.Dv, 03.65.Wj 


I. INTRODUCTION 

A fundamental task in quantum optics is the recon¬ 
struction of the state of the light field, the complete 
description of which is contained in the density matrix. 
Close to the boundary between quantum and classical re¬ 
gions the density matrix is conveniently studied in phase 
space representation, e.g., through the Wigner function 
P], since this allows the investigation of the transition 
between the two regimes. There is a wealth of tech¬ 
niques EHa to measure the Wigner function, most of 
which require many copies of the state and many mea¬ 
surement shots. Producing many copies, however, is not 
always possible or efficient. In systems just crossing the 
classical-to-quantum boundary such as nanomechanical 
resonators [6, 7J, for example, data can be so noisy and 
prone to drift that the required number of experimental 
runs with nominally identical parameters is not possible. 

Often, this laborious task of full quantum state tomog¬ 
raphy is also asking a too general or too unspecific ques¬ 
tion. In many cases, it is sufficient to approximate the 
state by estimating a few parameters characterizing it. 
This is the case, e.g., in the simple and prima facie clas¬ 
sical task of measuring both the quadrature amplitude 
|a| and phase arg(a) of a weak ac signal, for example, 
in the microwave range. These signals are represented 

as coherent quantum states \a) = e a - ^=ya n |n), 

with | n) photon number eigenstate with n photons. Esti¬ 
mating coherent states is an important stepping stone for 
the characterization of more complicated quantum states 
of light because many such states, e.g., a Schrodinger 
cat state \a) + | — a) (8], a so-called voodoo cat state 
e~ l 5 |e -2 S a) + \e 1 ^a) — \ — a) [9], and a compass state 
|a) + i\ia) — | — a) — i\ — ia) [TQ], can be presented as 
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superpositions of coherent states. Direct applications of 
quadrature measurements include, for example, cosmic 
microwave background detection m and the search for 
dark matter axions m 

Estimation of an optical phase alone, at fixed coherent 
state amplitude mm, is the basis of many metrologi¬ 
cal applications, e.g., in magnetometry mm, detection 
of gravitational waves CEH2D1, and clock synchronization 
m ■ Phase estimation is also an indispensable compo¬ 
nent of several algorithms in quantum information pro¬ 
cessing [22]. Since it is difficult to define the concept 
of phase measurement for a single mode only, the usual 
approach is to consider two-mode measurements in an 
interferometer (see Fig. 1 in [23]). The ultimate limit set 
by quantum mechanics to the precision of phase measure¬ 
ments is due to complementarity between photon num¬ 
ber and phase. This translates to a so-called Heisenberg 
limit, phase variance scaling as ~ N ~ 2 with N the total 
number of photons that pass through the interferometer. 
Note that inverse phase variance describes the Fisher in¬ 
formation (variance of the score) [24] of the phase esti¬ 
mate. 

Optimal measurements for phase estimation have been 
identified theoretically in emui but not realized experi¬ 
mentally; it is not possible to perform them with pho¬ 
todetections at the output of the interferometer. In¬ 
put states and measurements with an error scaling close 
to optimum (but with a different prefactor) have been 
proposed optimizing adaptively the next measurement 
in the series of measurements [23J 3TJ (local optimiza¬ 
tion). Adaptive measurements have also been designed 
attempting to optimize the whole series of measurements 
[32H35] (global optimization). The input state to the 
interferometer considered in E3113 El ED CEDES], how¬ 
ever, is not separable between photon number eigenstates 
in the output arms of the interferometer and its creation 
is currently an open question. Beating the standard 
quantum limit has, however, been demonstrated using 
simpler entangled input states [36U39] . Here, we are go- 
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ing beyond bare phase estimation in addressing simulta¬ 
neous adaptive estimation of both phase and amplitude 
of a. In terms of density operators, we thus estimate the 
state within a family p a = |<a)(<a|, a G C. 

The most popular approach to quantum state esti¬ 
mation has been maximum likelihood estimation (MLE) 
M- Given measurement settings S = {si,..., sm} and 
corresponding data D = {di,..., it seeks for a 

physical state p that maximizes the likelihood functional 
P(D\p,S) = ntiiP(di\p,Si), with P(di\p,Si) the prob- 
ability to obtain the measurement outcome d{ given the 
state p and measurement setting Si. However, MLE does 
not deliver confidence intervals for the estimates. More¬ 
over, a basic problem with MLE is that it tends to assign 
vanishing values for certain eigenvalues of p |41| . This is 
unreasonable since it is not possible to completely rule 
out some measurement outcomes with a finite amount of 
data. 

More advanced approaches based on Bayesian infer¬ 
ence do not suffer from these shortcomings. Bayesian 
inference techniques have been developed, e.g, for phase 

mmmmmm , state [31135], and Hamiltonian 

[46U5T] estimation. For certain one-parameter estima¬ 
tion problems it is possible to perform local Bayesian 
optimization of the measurement settings analytically 
For a larger number of unknown pa¬ 
rameters, however, finding optimal measurement settings 
adaptively becomes generally analytically intractable. To 
perform the Bayesian updates numerically, a sequential 
Monte-Carlo approach [44ll49lf55] has recently undergone 
a strong development. 

Bayesian reasoning provides a general framework to 
assign a probability distribution for system parameters 
given certain data. The nature of the data, however, de¬ 
pends on the measurement settings chosen by the experi¬ 
menter. In general it is more effective to choose the mea¬ 
surement settings adaptively so that they depend on the 
data that has been collected. Generally, the specific set 
of rules, also called a policy, according to which the mea¬ 
surement setting 8 is to be updated, has to be developed 
separately for each problem at hand. This is true also 
for the recently developed technique called self-guided 
quantum tomography (SGQT) [56]. SGQT searches the 
estimate of a quantum state by making measurements in 
the directions close to the estimate, approaching it as a 
power law as a function of adaptive iteration steps. The 
optimal prefactors in the related power laws (as well as 
certain coefficient added to the base of an exponential), 
however, depend on the problem and parameter region. 


II. EXPERIMENTAL SETUP 

For definiteness, we consider measurement of the mode 
using an ideal vacuum detector [57H6Q] . A vacuum detec¬ 
tor provides a click if there is more than zero photons in 
the cavity but does not give any indication of the photon 
number beyond that. 



Figure 1: Measurement setup. The unknown coherent state 
within the cavity is denoted by \a). The microwave source 
emits a pulse that displaces the coherent state by /3 in phase 
space. The cavity is capacitively coupled to a Josephson pho¬ 
tomultiplier (JPM) that registers one of two possible mea¬ 
surement outcomes: The displaced state is either vacuum or 
a state containing photons. 


Such detectors have been realized in optics in the 
strongly coupled regime. At microwave frequencies, the 
recently realized m Josephson photomultiplier [62, [63] 
has been shown theoretically [58] to reach this ideal vac¬ 
uum detector limit at weak tunneling and long interac¬ 
tion time. In contrast to a standard Mach-Zender inter¬ 
ferometric scheme for phase measurements, photon num¬ 
ber resolving detectors or beam splitters are not needed, 
and there is no entanglement involved. This technology 
is also suited for the measurement of a qubit state and 
should allow better scalability to larger circuits than that 
based on superconducting amplifiers. The latter requires 
a strong auxiliary microwave pump tone that must be 
isolated from the qubit circuitry with bulky cryogenic 
isolators, which with the former technology can be elim¬ 
inated. 

In the measurement with a vacuum detector, different 
points in phase space may be accessed by injecting an 
additional drive pulse to the input signal, cf. Fig. E a p - 
propriately normalized, the drive pulse U(£) displaces the 
coherent state by an amount /3 = — | f in phase 

space, i.e., it turns \a) into \a + /?), hence allowing one 
to measure the Husimi Q-function Q(a) = -(ql\p-r\ql) = 
^|(0|<a + /3)| 2 , with p_p = \ - /3)(-/3\. 

In terms of a positive operator-valued measure 
(POVM), a measurement setting is thus characterised by 
a set of operators M s , with s = f3 given by the pulse 
parameter. The set M s is defined by M s = {M s , d \d G 
{v,p}}, where d indexes the possible measurement out¬ 
comes, a vacuum state or a state containing photons. 
Here, we only need the operators Mp iV = | — /?)(— /3\ and 
p = I - 

In [9] and [64], respectively, the states of a supercon¬ 
ducting resonator and a nanomechanical resonator were 
characterized. The tomographic method used in these 
papers is analogous to the measurement model above, 
since it consists of displacing the resonator state by a 
microwave pulse and then performing a projective mea¬ 
surement of the qubit. In [65] a nonadaptive tomo¬ 
graphic scheme based on semidefinite programming was 
presented for the characterization of NOON states in res¬ 
onators coupled to qubits. This type of measurements 
can be contrasted to probing of the Wigner function by 
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Figure 2: (Color online) Illustration of an adaptive Bayesian 
inference scheme. 


full photon counting with number resolution [66j [67]. 


III. BAYESIAN INFERENCE 


The basic difference between frequentist and Bayesian 
approaches to parameter estimation is that the latter al¬ 
low one to assign an initial prior probability distribution 
P(d) to describe an unknown state |a). It quantifies the 
experimenters a priori conception of the state and its 
uncertainty. 

Once the initial prior is set and given a suitable pol¬ 
icy, one can take advantage of the information con¬ 
tained in the prior in choosing the measurement set¬ 
ting s. The policies used in this paper are described in 
Sec. V. Bayesian inference proceeds by iteratively apply¬ 
ing Bayes’ theorem 


P(a\d, s ) 


P(d|cq s)P(a) 

pW) 


(i) 


as illustrated in Fig. [2] Here P(d|d,s), referred to as 
likelihood, is the probability to obtain data d (here, 
d = v, p) in the state |a), given the measurement set¬ 
ting s. Using the notation of POVM in Sec. II, it is 
related to the Hermitian operators M s ^ through Born’s 
rule P(d\a,s) = Tr (M s ^p a ). The normalization factor 
P(d\s) is obtained by integrating the likelihood over all 
possible states P(d\s) = f P(d\a, s)P(a)da. The prob¬ 
ability distribution P(a\d,s) for |a) given data d and 
measurement setting s is called the posterior. The pos¬ 
terior can be set as the prior for the next measurement 
which allows iterative application of Eq. 0 

P[oi\{dM+i,SM+i),(D,S)\ = 

P(^M+l|<U Sm+i)P\oi\(D, £)] 
f P(d M +i\ot., s M +i)P[a\(D, S)]da' J 
(P, S) = {( dM , sm ) • • •, (di, si)}. (2) 


Once a sufficient amount of data has been collected and 
the posterior is narrow enough, the estimate a is obtained 
from its mean value. The functional form of our likeli¬ 
hood function P(d\a,s) as well the rules to choose the 
measurement settings s are described in Sec. V. 


IV. NUMERICAL METHOD 


The sequential Monte-Carlo method HUGHES] deliv¬ 
ers an efficient numerical method to perform the updates 
of the posterior. The posterior is approximated by keep¬ 
ing track of its value in iV p moving grid points, or “parti¬ 
cles,” P[a\(D,S)] ~ J2n=i w n S(a — a n ). Herea n are the 
locations of the particles while w n are their relative prob¬ 
abilities or weights that can be updated through Bayes’ 
theorem, 


UV n 

(m+1) 

UJ n 


P(d m+1 \a n ,s n )w { ™\ 

r (m+1) 


+- 


Wn 


7 r/ m+1 ) ’ 

Z-m=l w n 


( 3 ) 

( 4 ) 


Here, are the weights evaluated after the mth 

measurement shot. Equation 0 ensures the norming 
J2n= l w n — 1 and the conservation of probability. 

In the following, the key quantities are the mean a and 
the covariance matrix Cov(d) of a over the posterior. 
Numerically, these are readily approximated through 


a = / P(a\D)ada ~ w n 

n =1 

Cov(o') = J P(a\D)aa T da 


( 5 ) 


~ ~T 

aa 


E 

n= 1 


Pdn d n dm dd 


a = 


/Re(a)\ 


, a T = [Re(a), Im(a)]. 


(6) 


A fixed grid would limit the achievable precision of the 
estimate, but we make use of an adaptive grid 0E2- 
ISS] which makes it possible to focus the particles in the 
regions where the probability distribution concentrates. 
Here, 7V P locations a n are first chosen following the dis¬ 
crete probability distribution {w n }n=i- The particles are 
then assigned new locations a' n by sampling from the nor¬ 
mal distribution 


a n ~ J^\p n , (1 - a 2 )Cov(a)], (7) 

with the mean 


Hi = aa n + (1 — a) a (8) 

and the covariance matrix (1 — a 2 )Cov(d). Here, a is 
a parameter that we set to a = 0.999 95. Finally all 
the weights are set to w n = The artificial dynamics 
induced by Eqs. 0 and Q conserves by construction 
the covariance matrix Cov(a). 
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Figure 3: (Color online) Exemplary evolution of the particles 
(see text) describing the probability distribution of the un¬ 
known coherent state |a) in phase space, (a) Uniform initial 
prior within the disk \a\ < Ro . Here, Ro = 10. (b) Posterior 
after ten measurement shots. The “holes” at positions {c/} 
have been created by photon detections with pulse parameters 
[3 = — a . (c) Posterior after the first detection of a vacuum 
state (here the 11th measurement shot). The weight of the 
posterior is concentrated near a m —/3±, with /3i the pulse 
parameter corresponding to the first vacuum detection. 


In our calculations we choose 7V P = 50 000. To com¬ 
pare different policies, we apply them on 15 000 sim¬ 
ulated samples with randomly chosen true values a. 
We choose a from a uniform distribution on the region 
\a\ < Ro , with Ro = 10 [see Fig. J3^a)]. Note that Ro is 
related to the maximum expectation value of the photon 
number operator through Rq = (n) max . The initial prior 
was chosen to coincide with the aforementioned proba¬ 
bility distribution. While here, this prior exactly incor¬ 
porates what is known about estimated quantities before 
data collection, it has to be noted that in an actual ex¬ 
perimental situation there is no unique and objective way 
to assign the initial prior, but choosing it necessarily in¬ 
volves certain arbitrary or subjective elements. 


V. POLICIES 
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Figure 4: (Color online) Illustration of the policy of Eq. (12). 
The vertical and horizontal axes describe the number of vac¬ 
uum detections and the number of measurement shots per¬ 
formed after the first vacuum detection, respectively. Differ¬ 
ent regions (i)-(iv) correspond the actions on the first to the 
fourth row of Eq. (12), respectively. 


measurements with similar values of (3 unlikely. After 
the first detection of vacuum we adjust the support of 
Pp in the proximity of a « —/?i, with (3\ the measure¬ 
ment setting with which vacuum is detected. Before the 
first vacuum detection, the measurements are thus rel¬ 
atively uninformative, whereas most of the vacuum de¬ 
tections take place within a region with a radius 0(1) in 
phase space (see below). Hence, compared to nonadap- 
tive schemes, focusing the adaptive measurements in the 
correct region makes the number of required measure¬ 
ment shots smaller approximately by a factor rsj Ro or 
the area describing the initial uncertainty of a in phase 
space. 

Specifically, we choose the measurement settings (3 ac¬ 
cording to the following policy [68] 


Pp = 


f P[-P\{D,S)] if (7 = 0, 

< ^2(bw a for ^ + “I < r ( C ) R * 

0 otherwise if C > 1. 


if C > 1, 


( 10 ) 


Our measurement setting s is defined by the pulse pa¬ 
rameter [3. We consider an ideal vacuum detector with 
the likelihood functions for the detections of vacuum (v) 
and a state containing photons (p), respectively 

P(d = v| a, /?) = Tr(p a M^ v ) = e - * 2 , x = \a + /3\ 

P(d = p|a, (3) = Tr(p a M^ 5 p) = 1 - P(d = v|a, (3). (9) 

In the beginning of the experiment, the first detection of a 
vacuum state narrows the posterior P{a\D) significantly 
more than detection of a photon. In the latter case the 
posterior only changes in the proximity of a ~ — /?, where 
its value considerably decreases [see Fig.^b)]. However, 
in the former case the weight of the posterior is con¬ 
centrated near a ~ —/?, while outside this region the 
posterior is exponentially decreased [see Fig. [3^c)]. We 
therefore start the experiment by choosing the displace¬ 
ment pulse (3 randomly from a probability distribution Pp 
such that P—p = P[/3\(D,S)] equals the posterior (here 
the argument a has to be replaced by (3). This makes 


Here, C is the number of measurement shots that have 
detected the vacuum state, a is the current mean of the 
posterior, and R a describes the width of the region where 
the weight of the likelihood function is concentrated. 

More precisely, we define R a = ^Tr[Cov(<a)] + with 

Cov(ci') the covariance matrix for the Bayesian probabil¬ 
ity distribution of a [see Eq. ©• After the first detec¬ 
tion of vacuum, Pp is chosen to be a uniform probability 
distribution on a disk with a radius r(C)R a . We have 
carried out extensive numerical calculations to search an 
optimal r(C) in the form of a power law 

r(C) = aC b , (11) 


with a and b constants. The pair (a, b) £ R 2 parametrizes 
the space within which we search for a near-optimal pol¬ 
icy. 

Since with a single vacuum detection the posterior con¬ 
centrates near a ~ —/?, the policy (10) is not robust 
against readout errors in the experiment. However, it can 
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be made robust against such errors by confirming that an 
absence of a detector click is due to vacuum state rather 
than a readout error. This can be achieved through repe¬ 
tition. In the presence of readout errors we search policies 
of the form (see Fig. [4| 


f Pp = P[-l3\(D,S)] if C = 0, 

P = P U M'^M' + 1 if C > 1, Af' < M^ ax , 

< p = p u M' -> 0, C -> 0 if C t > C > 1, M' = M^ ax , (12) 
Pp = vr i(c)R% for !/? + «!< r(C)R a , 0 otherwise 

if C>C t ,M' = M^. 


Here, the measurements are repeated M(^ ax times at the 
setting Pi that indicates vacuum state (possibly a read¬ 
out error). The variable M' counts the measurement 
shots performed after the vacuum detection. Should af¬ 
ter M(^ ax shots the number of vacuum detections C be 
less than the threshold value C*, the variables M' and C 
are set back to value 0. If after M^ ax shots C is greater 
or equal to the threshold value Ct, the remaining mea¬ 
surement settings are chosen as in policy (10). Similarly 


as in policy ( |Tq| ) , we look for an optimal policy with r(C) 
of the form Eq. ( [IT] ). Here, we set M(^ ax = 39, C t = 15. 
In our computations we assume that the probability of 
misidentifying a vacuum state as a photon state and vice 
versa is P e = 0.1. 


VI. RESULTS 

In the absence of readout errors, we find that for an 
optimal policy, a <C 1 and r only weakly depends on 
C (see Fig. [5]). For instance, for the policy V that 
minimizes the median of the normalized squared error 
2|a t rue — d| 2 /i^ after 10 5 measurement shots, we find 
a = 0.04 and b = 0.05. Here, the width of the plateau 
obtained with a low number of measurement shots de¬ 
pends on the degree of the initial parameter uncertainty 
or the radius Rq. By a crude estimate, one expects that 
the first detection of a vacuum state takes place after 

^ O(10 2 ) measurement shots and that the error is 
then rapidly reduced to 0(1), corresponding to the width 
of the likelihood function [cf. Fig. [3](c)] . This is consis¬ 
tent with the fact that approximately after 50 measure¬ 
ment shots the plateau shape of the curves crosses over 
to a rapid decrease. Up to the level where the normalized 
median squared error reaches the value , the different 



Figure 5: (Color online) Median of the normalized squared 
error 2|o t rue — a\ 2 /Rq calculated from an ensemble of 15 000 
simulated samples (see text) through the policies of Eq. (10). 
Different curves are for different radii r denoted in the inset. 
Black curve is for r(C) calculated through Eq. (11). 


mately 39 % of the weight of P{d = v|a). One might 
expect that policies attempting to search for the steepest 
slope of the likelihood function, with a ~ 1, would be ef¬ 
fective, but this is not the case. The policies with smaller 
values of a are able to find a crude estimate faster. In 
the region x C 1 the likelihood of detecting a photon 
P(d = p|a) ~ x 2 is quadratically small. Therefore with 
a <C 1, once a crude estimate has been found, most mea¬ 
surement shots detect vacuum and confirm the estimate. 
However, since the relative rate of change 

1 dP{d = pjo) = 2x 2 
P(d = p|a) dx e x2 — 1 x ^ 

increases with decreasing x, the rare detections of pho¬ 
tons can effectively make a distinction between different 
possible values of a in the region x <C 1. The relative 
rate of change above describes how much, due to Bayes’ 
theorem 0 . a detection of a photon changes the rela¬ 
tive posterior probabilities of two possible values, a and 
a ', when \a — a'\ = |Ax| is fixed. Putting all together, 
measurement settings with x C 1 are, somewhat unex¬ 
pectedly, more effective than those with larger values of 
x. 
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curves overlap since until this point only the first line in 
Eq. (10) is executed, and the policy thus does not depend 
on r. The curve shape following the expected first vac¬ 
uum detection (rapid decrease of the error) is universal. 
Interestingly, we find that the curves with different val¬ 
ues of r cross, which means that the globally best policy 
can not be found by local optimization. 

With r = 1, the boundary of the disk mentioned above 
coincides with the steepest slope of the likelihood func¬ 
tion P(d\a) of Eq. 0- Such a disk contains approxi¬ 


Even though for optimal policies we have a < Cl, the 
optimal choice is not r = 0. Indeed, the policy at r = 0 
corresponds to choosing — P equal to the mean of the 
posterior, somewhat similarly with a simple, relatively 
ineffective, policy in the context of bare phase estimation 
where the control phase is chosen to coincide with the 
mean of the posterior (see Eq. (6.2) in [31]). 

Policies where the measurement strategy is changed 
after a certain number of measurement shots have been 
developed for phase |69j and Hamiltonian |48j estimation. 
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Figure 6: (Color online) Median of the normalized squared 
error 2|o t rue — a\ 2 /Ro calculated from an ensemble of 15 000 
simulated samples (see text) through the policies of Eq. (12) 
when the probability of a readout error is P e = 0.1. Different 
curves are for different radii r denoted in the inset. 


On the second line of Eq. (10), rather than on the num¬ 
ber of all the measurement shots, we expect a possible 
dependence on the number of shots that take place after 
the first detection of a vacuum state. We therefore count 
in Eq. (10) the number of shots in which a vacuum state 
is detected. 

In the presence of readout errors, we find that the op¬ 
timal value for a is larger than in the absence of these 
errors and the settings are therefore more spread around 
their mean. The dependence of r on C should still be 
weak so that \b\ <C 1 (see Fig. [6|. For the policy V e 
that minimizes the relative median squared error after 
10 5 measurement shots, we find a = 1.0 and 6 = 0. 

For each simulated ensemble we obtain some samples 
that we refer to as “outliers” for which the error signifi¬ 
cantly exceeds the median and the width of the posterior 
probability distribution. Our policies can be made ro¬ 
bust against such outliers through repetition as follows. 
After 10 000 measurement shots we set the prior back to 
the initial prior. We thereafter perform another 10 000 
measurement shots. We then compare the estimates after 
10 000 and 20 000 measurement shots. If their difference 
is smaller than a set threshold, we conclude that we have 
found a correct estimate, otherwise we start a new search 
of the estimate. For the new search we choose a prior that 
again coincides with the original prior. Tables I and II 
summarize the performance of the policies V and V e in 
the absence and presence of readout errors, respectively. 
These correspond to the policies V and V e supplemented 
with the outlier correction scheme. Outliers are defined 
as the samples with the squared error | a — a t rue| larger 
than threshold £ 2 . The outlier correction scheme appears 


to eliminate the outliers with an acceptable overhead. 

Table I: Number of outliers per 10 000 simulated samples with 
P e — 0 for the policy V (see text). Rows correspond to the 
number of outliers with normalized squared error larger than 
S 2 after a given number of measurement shots (indicated by 
the columns). 


£ 2 /Shots 2 x 10 4 

4 x 10 4 

6 x 10 4 

8 x 10 4 

1.2 x 10 5 

1.4 x 10 5 

10“ b 

8410 

2062 

504 

118 

5 

0 

nr 4 

2218 

173 

3 

0 

0 

0 

nr 3 

207 

105 

2 

0 

0 

0 


Table II: Number of outliers per 10 000 simulated samples 
with P e = 0.1 for the policy V e (see text). Rows correspond 
to the number of outliers with normalized squared error larger 
than S 2 after a given number of measurement shots (indicated 
by the columns). 


£ 2 /Shots 2 x 10 4 

4 x 10 4 

6 x 10 4 

8 x 10 4 

1.4 x 10 b 

2.2 x 10 b 

10 _b 

9283 

3474 

1293 

496 

27 

0 

10“ 4 

5075 

70 

1 

0 

0 

0 

10“ 3 

825 

42 

0 

0 

0 

0 


VII. DISCUSSION 

Based on Bayesian inference, we have delivered pow¬ 
erful methods for adaptive characterization of coherent 
states. For larger photon numbers (ft) 1, the adap¬ 
tive schemes discussed here reduce the number of mea¬ 
surement shots required to achieve a certain level of ac¬ 
curacy by a factor proportional to the area describing 
the initial uncertainty of a in phase space. In terms 
of measurement shots, we expect that efficiency can be 
quite generally improved by several orders of magnitude 
which motivates making experiments adaptively. For 
more complicated quantum states of light in a superposi¬ 
tion |T) = J2n=i \ a n), our results are readily applicable 
for estimating {\a n )}n!Li- For full identification of |T), 
our policies have to be supplemented with methods to ob¬ 
tain relative weights {\a n \ 2 }n=i °f different components 
as well as their relative phases. Our work thus constitutes 
a building block that opens up an avenue for efficient 
estimation of multicomponent Schrodinger-cat states of 
light. 


ACKNOWLEDGMENTS 

We acknowledge L. C. G. Govia, E. M. Leonard Jr., R. 
McDermott, I. Pechenezhskiy, G. J. Ribeill, S. F. Taylor, 
and T. Thorbeck for discussions. This work was sup¬ 
ported by the European Union through ScaleQIT. 





















7 


[1] E. Wigner, Phys. Rev. 40, 749 (1932). 

[2] D. T. Smithey, M. Beck, M. G. Raymer, and A. Faridani, 
Phys. Rev. Lett. 70, 1244 (1993). 

[3] L. G. Lutterbach and L. Davidovich, Phys. Rev. Lett. 
78, 2547 (1997). 

[41 A. I. Lvovsky and S. A. Babichev, Phys. Rev. A 66, 
011801 (2002). 

[5] P. Bertet, A. Auffeves, P. Maioli, S. Osnaghi, T. Meunier, 

M. Brune, J. M. Raimond, and S. Haroche, Phys. Rev. 
Lett. 89, 200402 (2002). 

[6] A. N. Cleland and M. R. Geller, Phys. Rev. Lett. 93, 
070501 (2004). 

[7] J. Bochmann, A. Vainsencher, D. D. Awschalom, and A. 

N. Cleland, Nature Phys. 9, 712 (2013). 

[8] S. Deleglise, I. Dotsenko, C. Sayrin, J. Bernu, M. Brune, 
J.-M. Raimond, and S. Haroche, Nature (London) 455, 
510 (2008). 

[9] M. Hofheinz et al. , Nature (London) 459, 546 (2009). 

[10] G. Kirchmair, B. Vlastakis, Zaki Leghtas, S. E. Nigg, 
H. Paik, E. Ginossar, M. Mirrahimi, L. Frunzio, S. M. 
Girvin, and R. J. Schoelkopf, Nature (London) 495, 205 
(2013). 

[11] P. K. Day, H. G. DeDuc, B. A. Mazin, A. Vayonakis, and 
J. Zmuidzinas, Nature (London) 425, 817 (2003). 

[12] R. Bradley et al ., Rev. Mod. Phys. 75, 777 (2003). 

[131 V. Giovannetti, S. Lloyd, and L. Maccone, Science 306, 
1330 (2004). 

[14] D. W. Berry, B. L. Higgins, S. D. Bartlett, M. W. 
Mitchell, G. J. Pryde, and H. M. Wiseman, Phys. Rev. 
A 80, 052114 (2009). 

[15] G. Waldherr et a/., Nat. Nanotechnol. 7, 105 (2012). 

[16] N. M. Nusran, M. U. Momeen, and M. V. G. Dutt, Nat. 
Nanotechnol. 7, 109 (2012). 

[17] A. J. F. Hayes and D. W. Berry, Phys. Rev. A 89, 013838 
(2014). 

[18] C. M. Caves, Phys. Rev. D 23, 1693 (1981). 

[19] A. Abramovici et al. , Science 256, 325 (1992). 

[20] K. Goda et al. , Nature Phys. 4, 472 (2008). 

[21] M. de Burgh and S. D. Bartlett, Phys. Rev. A 72, 042301 
(2005). 

[22] M. A. Nielsen and I. L. Chuang, Quantum Computation 
and Quantum Information , (Cambridge University Press, 
Cambridge, UK, 2000). 

[23] D. W. Berry and H. M. Wiseman, Phys. Rev. Lett. 85, 
5098 (2000). 

[24] E. L. Lehmann and G. Casella, Theory of Point Estima¬ 
tion (Springer, New York, 1998). 

[25] A. S. Holevo, Rep. Math. Phys. 16, 385 (1979). 

[26] B. C. Sanders and G. J. Milburn, Phys. Rev. Lett. 75, 
2944 (1995). 

[27] B. C. Sanders, G. J. Milburn, and Z. Zhang, J. Mod. 
Opt. 44, 1309 (1997). 

[28] A. Luis and J. Perina, Phys. Rev. A 54, 4564 (1996). 

[29] V. Buzek, R. Derka, and S. Massar, Phys. Rev. Lett. 82, 
2207 (1999). 

[30] H. Imai and M. Hayashi, New J. Phys. 11, 043034 (2009). 

[31] D. W. Berry, H. M. Wiseman, and J. K. Breslin, Phys. 
Rev. A 63, 053804 (2001). 

[32] A. Fujiwara, J. Phys. A: Math. Theor. 39, 12489 (2006). 

[33] A. Hentschel and B. C. Sanders, Phys. Rev. Lett. 104, 
063603 (2010). 


[34] M. Hayashi, Commun. Math. Phys. 304, 689 (2011). 

[35] A. Hentschel and B. C. Sanders, Phys. Rev. Lett. 107, 
233601 (2011). 

[36] B. L. Higgins et al. , Nature (London) 450, 393 (2007). 

[37] T. Nagata, R. Okamoto, J. L. O’Brien, K. Sasaki, and S. 
Takeuchi, Science 316, 726 (2007). 

[38] R. Okamoto, H. F. Hofmann, T. Nagata, J. L. O’Brien, 
K. Sasaki, and S. Takeuchi, New J. Phys. 10, 073033 
(2008). 

[39] J. A. Jones, S. D. Karlen, J. Fitzsimmons, A. Ardavan, 
S. C. Benjamin, G. A. D. Briggs, and J. J. L. Morton, 
Science 324, 1166 (2009). 

[40] D. F. V. James, P. G. Kwiat, W. J. Munro, and A. G. 
White, Phys. Rev. A 64, 052312 (2001). 

[41] R. Bloume-Kohout, New J. Phys. 12, 043034 (2010). 

[42] H. M. Wiseman, Phys. Rev. Lett. 75, 4587 (1995). 

[43] M. A. Armen, J. K. Au, J. K. Stockton, A. C. Doherty, 
and H. Mabuchi, Phys. Rev. Lett. 89, 133602 (2002). 

[44] F. Huszar and N. M. T. Houlsby, Phys. Rev. A 85, 
052120 (2012). 

[45] K. S. Kravtsov, S. S. Straupe, I. V. Radchenko, N. M. T. 
Houlsby, F. Huszar, and S. P. Kulik, Phys. Rev. A 87, 
062122 (2013). 

[46] S. G. Schirmer and D. K. L. Oi, Phys. Rev. A 80, 022333 
(2009). 

[47] A. Sergeevich, A. Chandran, J. Combes, S. D. Bartlett, 
and H. M. Wiseman, Phys. Rev. A 84, 052315 (2011). 

[48] C. Ferrie, C. E. Granade, and D. G. Cory, Quant. Inf. 
Proc. 12, 611 (2013). 

[49] N. Wiebe, C. E. Granade, C. Ferrie, and D. G. Cory, 
Phys. Rev. Lett. 112, 190501 (2014). 

[50] N. Wiebe, C. E. Granade, C. Ferrie, and D. G. Cory, 
Phys. Rev. A 89, 042314 (2014). 

[51] M. P. V. Stenberg, Y. R. Sanders, and F. K. Wilhelm, 
Phys. Rev. Lett. 113, 210404 (2014). 

[52] M. West, J. Roy Stat. Soc. B 55, 409 (1993). 

[53] N. J. Gordon, D. J. Salmond, and A. F. M. Smith, Radar 
and Signal Processing IEE Proc. F 140, 107 (1993). 

[54] J. Liu and M. West, in Sequential Monte Carlo Meth¬ 
ods in Practice , edited by A. Doucet, N. Freitas, and N. 
Gordon (Springer, New York, 2001). 

[55] C. E. Granade, C. Ferrie, N. Wiebe, and D. G. Cory, 
New J. Phys. 14, 103013 (2012). 

[56] C. Ferrie, Phys. Rev. Lett. 113, 190404 (2014). 

[57] J. Sperling, W. Vogel, and G. S. Agarwal, Phys. Rev. A 
85, 023820 (2012). 

[58] L. C. G. Govia, E. J. Pritchett, S. T. Merkel, D. Pineau, 
and F. K. Wilhelm, Phys. Rev. A 86, 032311 (2012). 

[59] L. C. G. Govia, E. J. Pritchett, and F. K. Wilhelm, New 
J. Phys. 16, 045011 (2014). 

[60] D. K. L. Oi, V. Potocek, and J. Jeffers, Phys. Rev. Lett. 
110, 210504 (2013). 

[61] Y.-F. Chen, D. Hover, S. Sendelbach, L. Maurer, S. T. 
Merkel, E. J. Pritchett, F. K. Wilhelm, and R. McDer¬ 
mott, Phys. Rev. Lett. 107, 217401 (2011). 

[62] A. Poudel, R. McDermott, and M. G. Vavilov, Phys. Rev. 
B 86, 174506 (2012). 

[63] B. Peropadre, G. Romero, G. Johansson, C. M. Wilson, 
E. Solano, and J. J. Garcfa-Ripoll, Phys. Rev. A 84, 
063834 (2011). 

[64] A. D. O’Connell et al. , Nature (London) 464, 697 (2010). 



[65] S. T. Merkel and F. K. Wilhelm, New J. Phys. 12, 093036 

( 2010 ). 

[66] K. Banaszek and K. Wodkiewicz, Phys. Rev. Lett. 76, 
4344 (1996). 

[67] K. Banaszek, C. Radzewicz, K. Wodkiewicz, and J. S. 
Krasinski, Phys. Rev. A 60, 674 (1999). 

[68] It is unlikely but possible that vacuum is not detected 
even though a large number, Mo, of measurement shots 
is carried out. Due to limited numerical accuracy, this can 


lead to a drastic failure of the estimation if the particles 
do not converge close to the true value. Therefore, in case 
vacuum has not been detected after Mo measurement 
shots, we switch the first line of Eq. (2) to a uniform 
probability distribution Pp — ^2 on the region (3 < Ro- 

We set Mo = 600 and Ro — 10. 

[69] M. W. Mitchell, Proc. SPIE 5893, 589310 (2005). 



